Two decades of fumigation data from the Soybean Free Air Concentration Enrichment facility

The Soybean Free Air Concentration Enrichment (SoyFACE) facility is the longest running open-air carbon dioxide and ozone enrichment facility in the world. For over two decades, soybean, maize, and other crops have been exposed to the elevated carbon dioxide and ozone concentrations anticipated for late this century. The facility, located in East Central Illinois, USA, exposes crops to different atmospheric concentrations in replicated octagonal ~280 m2 Free Air Concentration Enrichment (FACE) treatment plots. Each FACE plot is paired with an untreated control (ambient) plot. The experiment provides important ground truth data for predicting future crop productivity. Fumigation data from SoyFACE were collected every four seconds throughout each growing season for over two decades. Here, we organize, quality control, and collate 20 years of data to facilitate trend analysis and crop modeling efforts. This paper provides the rationale for and a description of the SoyFACE experiments, along with a summary of the fumigation data and collation process, weather and ambient data collection procedures, and explanations of air pollution metrics and calculations.


Background & Summary
Prior to the Industrial Revolution, the concentration of carbon dioxide ([CO 2 ]) in the atmosphere did not exceed 280 parts per million (ppm) for at least 800,000 years 1 . The global monthly mean for ambient [CO 2 ] was 416 ppm in October of 2022, a ~50% increase since industrialization and a ~20% increase since 1980 [https:// gml.noaa.gov/ccgg/trends/global.html]. Atmospheric [CO 2 ] is continuing to rise at an unprecedented rate and without significant emissions reductions, the concentrations will reach 700 to 1100 ppm by 2100 2 . In addition to CO 2 , the tropospheric concentration of ozone ([O 3 ]) has also increased since the Industrial Revolution. Although O 3 in the stratosphere, which is about eight kilometers above the Earth's surface, provides a useful barrier for ultraviolet radiation, O 3 in the surface tropospheric layer is a toxic pollutant. The ~40% increase in tropospheric [O 3 ] since industrialization 3 has caused harm to humans, animals, and plants 4 . These increases in CO 2 and O 3 concentrations have already directly impacted plants 5,6 , and future atmospheric increases will only intensify impacts on agriculture.
For over a century, scientists have tried to understand the effects of atmospheric change on agriculture [6][7][8][9][10] . To this end, researchers have experimented with different approaches to alter greenhouse gas concentrations around plants. Controlled environmental enclosures, including growth chambers, allow for relatively precise control of light, humidity, temperature, and atmospheric composition, enabling scientists to mechanistically test how CO 2 or O 3 affect plant function while all other conditions are held constant. However, growth chambers are typically small which limits the number of individual plants that can be examined, and there are often inconspicuous differences in supposedly identical chambers 11 . Also, in addition to enriching the air with CO 2 and O 3 , chamber experiments can have unwanted impacts on other aspects of the environment surrounding the plant 12 . For example, plants in chamber studies are grown in pots which can restrict root growth, altering the plant response to elevated [CO 2 ] 13,14 . Greenhouses and naturally sunlit outdoor growth chambers are an alternative option to indoor chambers and can be used to study plants in natural environments 9 . Because the sides of outdoor chambers typically are made of acrylic plastic, they can partially block radiation and potentially increase temperature and humidity. Outdoor chambers can also increase [CO 2 ] at the plant canopy level if the fumigation is released from the bottom of the chamber 9 . Additionally, open-top chambers (OTCs) shelter vegetation from the wind and force air upwards through the canopy, which alters the natural atmospheric coupling 12 . Thus, although outdoor chambers use aspects of the natural environment to operate, they can also modify the surrounding environment in undesirable ways similar to indoor chambers. These environmental and atmospheric impacts make the results of chamber studies less comparable to the natural response of plants to climate change, which is a clear limitation of both indoor and outdoor growth chambers.
Free Air Concentration Enrichment (FACE) facilities were developed as a 'real-world' approach to understanding how plants respond to altered atmospheres in fully open-air conditions 9,10,15,16 . An important benefit of the FACE approach is the ability to study the interaction of multiple atmospheric variables with elevated [CO 2 ] and [O 3 ], and the effects of these interactions on plants in nature 17 . Unlike growth chambers which are restricted to only 1 to 15 m 2 in size, FACE plots can range in size from 100 to 300 m 2 , allowing for experiments that are larger in scope and more varied than OTC experiments 18 . A typical FACE plot consists of a circular or octagonal array of pipes that release CO 2 or air enriched with CO 2 or O 3 at the canopy surface for small stature vegetation, or at varying levels from the ground to the top of the canopy for larger stature vegetation 10 [29][30][31] . More recently, the facility has been used to study genetic variation in crop responses to atmospheric change [32][33][34] and genetically engineered adaptation to rising [CO 2 ] and temperature 35,36 3 ] data are collected every four seconds along with the wind speed and wind direction. These data are averaged to produce 1-minute fumigation 37 (File 9) and ambient data files, which can be used to calculate fumigation efficiency statistics and hourly, daily, monthly, and seasonal fumigation and ambient data metrics. As the longest running FACE facility in the world, it is fitting that we provide broad access of our data to the scientific community. The abundance of fumigation data from experiments executed over two decades make the SoyFACE fumigation data particularly useful for modeling the impacts of rising CO 2 and O 3 on crop physiology and agronomy, as well as ecosystem function.

Methods
Field site & experimental design. The 32 ha SoyFACE farm is located on the south side of the University of Illinois Urbana-Champaign campus (228 meters above sea level; 40° 02′ North, 88° 14′ West). Each year, approximately half of the field is planted with soybean (Glycine max) and half with maize (Zea mays). The maize crop is fertilized with ~200 kg N ha −1 and the soybean crop is not fertilized. Lime has been periodically applied to the crops over the past 20 years, with pre-and post-emergent herbicides applied to the crops per common practice in the region. The two predominant soil types at the farm are Drummer silty clay loam and Flanagan silt loam. Within the 32 ha field, there are fixed locations for 32 octagonal plots. Soybean and maize are each grown over roughly half of the facility in a given year, and rotate positions in successive years. Each FACE plot is 20 meters in diameter with an area of ~280 m 2 (Fig. 1a,b). In a given year, experiments within a specific crop have used 8-16 plots in randomized complete block designs. Most experiments have been designed with replication at the plot level of n = 4 (Tables 1, 2). In some cases, split-plot treatments of drought or temperature have also been applied 27,38 . The crops are typically planted between mid-May and mid-June, with fumigation starting within two weeks of the planting date and crop harvests occurring in September and October. are measured from the center of each plot (Fig. 1b) and sent to a computer control system in the field, which inputs the measurements into the Proportional Integral Differential (PID) algorithm (Eq. 1). The PID algorithm is a commonly used control process that uses key sensor inputs and setpoints to calculate the output variable. The SoyFACE PID algorithm includes a wind speed component that is not found in other PID algorithms and was developed by Lewin et al. 39 Table 2 (2011 through 2021). The process of adjusting the CO 2 and O 3 levels is inexact since it is impossible to predict www.nature.com/scientificdata www.nature.com/scientificdata/ exactly which valve setting should be used to add a specific amount of gas to each plot. However, by using the PID algorithm at frequent intervals, the fumigation system is able to use the valve settings to quickly correct gas level adjustments that are above or below the desired amount. The goal of the SoyFACE system is to maintain the gas levels as closely as possible to the setpoints by continually measuring the current levels, implementing the PID algorithm, and releasing additional gases into the plots as needed.  www.nature.com/scientificdata www.nature.com/scientificdata/ 5. K p : Proportional coefficient; indicates that the control variable should be adjusted proportionally to the error in the system, or how much the process variable differs from the setpoint: error = (G stpt -G pv ). 6. K i : Integral coefficient. 7. K d : Differential coefficient. 8. K w : Wind coefficient. 9. Int(): Integral function; measures the accumulation of the error over time. 10. Der(): Differential function; compensates for sudden changes in the error.
Coefficients K p , K i , K d , and K w are constant values determined through analysis of iterative test runs of the fumigation process in early SoyFACE experiments. Currently the coefficients are as follows: The fumigation system at the SoyFACE farm was initially designed to distribute CO 2 to the treatment plots. The early SoyFACE model was based upon the system used at a poplar FACE plantation (POPFACE) in Tuscania, Italy. This fumigation system was designed to release pure CO 2 into the atmosphere at a high velocity through a large number of small regularly spaced air jets to create a shock wave and turbulence at the gas exit point. This enhanced the mixing of CO 2 into the ambient air and since the jets face outward, the CO 2 is mixed www.nature.com/scientificdata www.nature.com/scientificdata/ with the air prior to being carried by the wind back into the plot. This provides a relatively uniform elevated [CO 2 ] within the plot 16 .
In addition to the air jet configuration, determining the ideal pressure settings for each step of the CO 2 fumigation process was critical. To this end, a manually controlled pressure generator with narrow perforations was used to modify the pressure of the CO 2 gas flow within the pipes in eight steps, from 0.15 MPa to 0.45 MPa. The ability to change the air pressure allowed for better regulation of the CO 2 flow rate as the gas was transported through an underground HDPE pipeline to horizontal pipes along the sides of the octagonal plots. Additionally, the voltage calculated by the PID algorithm regulated the pressure inside the horizontal pipes and controlled the amount of CO 2 entering the plots. Following the release of CO 2 , natural wind currents facilitated the distribution of CO 2 throughout each plot.
The CO 2 fumigation system at SoyFACE retains much of the same design of the original POPFACE model. Liquid CO 2 is stored in a 50-ton vertical tank at the SoyFACE facility and passed through vaporizing equipment to produce gaseous CO 2 , which is delivered to specific locations in the field through underground pipes. CO 2 is transported to a manifold (gas delivery system; Fig. 1c) outside of the plot and tubing delivers the CO 2 to the tubes surrounding the treatment plots. CO 2 is released through a linear flow valve (SMC pressure controllers 16 ) and pure CO 2 is released into the wind through 350 or 500 small air jets placed 15 mm apart and drilled into 8-meter-long pipes that surround the SoyFACE plots (Fig. 1). The flow valves have settings between 0 and 10, with a setting of 0 indicating a completely closed valve and a setting of 10 indicating a completely open valve. The control computer system (formerly Z-World Inc. Model SR9000; currently Rabbit BLS4200 series) calculates the amount of gas that should be released into each plot based on information from wind sensors (R.M. Young Model 12005), CO 2 analyzers (PP Systems SBA series), and the PID algorithm. The high jet velocity of the CO 2 gas stream creates rapid dilution with the ambient air 17 . Throughout the past 20 years, as seen in Fig. 2a,b, the mean wind speed varied slightly over the site, and was 2.0 and 1.7 m/s in Plot 14 and Plot 3, respectively. These two plots exemplify the variation in wind speed and direction measured at the site and were used in most   (Tables 1, 2). In the instances when the wind speeds dropped below 0.2 m/s, the CO 2 fumigation system cycled CO 2 gas around the plots to maintain the setpoint as closely as possible.
Testing of different aspects of the CO 2 fumigation system at POPFACE and other FACE sites assisted with the development of the CO 2 fumigation process used at SoyFACE. However, CO 2 was not the only greenhouse gas of interest, and the fumigation system was modified to deliver O 3 treatments in addition to CO 2 . In preparation of the O 3 fumigation process, liquid oxygen is stored in a 450-liter cryogenic tank at SoyFACE. The liquid O 2 is passed through vaporizing equipment to produce gaseous O 2 . Then, a generator is used to pass gaseous O 2 through a high-voltage dielectric field (~6000 volts) inside a generator (PCI-Wedeco Model GA40 prior to 2005; currently Ozonia Ozat Model CFS-3 2 G). The dielectric field forces some of the O 2 molecules to disassociate and recombine to form O 3 , producing up to 3.5 kg of O 3 per day. Due to the toxicity of O 3 and the fact that the generator produces the gas at a low pressure, it must be pressurized and mixed with compressed air before being transported to the SoyFACE plots. This is accomplished with a bypass venturi differential pressure injector (Mazzei Injectors Model 384-X), which forces O 3 to mix into the higher-pressure air stream. The compressed air stream enters the bypass pressure injector at 90 PSI, O 3 enters the pressure injector at a low pressure of 8 PSI, and the resulting O 3 -air mixture has a pressure of 35 PSI. The change in pressure allows the O 3 gas mixture to be delivered from the pressure injector to the manifold. Computer-controlled linear flow valves (PCI-Wedeco Model GA40 prior to 2005; currently Teledyne Hastings Model HCF-302) control the release of O 3 -enriched air into the wind. The concentration of O 3 in the center of each octagonal plot is monitored with an O 3 analyzer (Thermo Fisher Scientific Model 49 C/49I), and that information is used to control the setpoint with the PID algorithm as described for CO 2 fumigation.
The gas concentrations and wind data are transmitted to a central computer located in an onsite trailer for general data storage and performance analysis. The control computer uses the wind direction measurements to control which main sector of the octagonal treatment plot releases CO 2 or O 3 -enriched air, with the two neighboring octagon sectors releasing a smaller amount of CO 2 or O 3 . Since the three fumigation entry sectors are most directly upwind, following the high-pressure valve release, the gases distribute evenly throughout the octagon and dilute to the background gas concentrations within ~100 meters of the plot.  48 . The radii length of the concentric circles represents the percentage frequency of measurements that have wind speeds between 0 to 2 m/s (blue), 2 to 4 m/s (green), 4 to 6 m/s (orange), and >6 m/s (red). The wind direction ranges include plus or minus 15 degrees from the given direction, starting at 0° on the upper vertical axis (N) and moving clockwise in 30° increments back to 360°. In the lower right corner of the plots, the mean refers to the overall mean wind speed for the plot, and the calm percentage indicates the percentage of calm observations omitted from the wind rose plot. Following the convention of the National Weather Service, winds with a direction of 0° are considered 'calm' , while winds with a direction of 360° are assumed to be from the north. www.nature.com/scientificdata www.nature.com/scientificdata/ Data collection & processing methods. Wind serves a crucial role in the fumigation experiments by dispersing CO 2 and O 3 throughout each treatment plot, and therefore accurate wind data are important. To this end, wind data for each plot are recorded at 4-second intervals along with the CO 2 and O 3 fumigation levels, setpoint gas levels, and flow valve settings. In particular the wind direction data have been analyzed at SoyFACE, with the prevailing wind direction at the experiment site found to be South/Southwest with some variation between treatment plots due to the difference in location (Figs. 2, 3). The central computer at SoyFACE receives the 4-second and 1-minute fumigation data files (which are averaged from the 4-second data files), and stores both sets of data. Occasionally, there have been data losses and errors at SoyFACE due to extreme weather or technological issues with the analyzers or computer systems. The use of customized Matlab computer code and functions (detailed in the Matlab Files sub-section of the Data Records section) allows the identification of gas measurements that are outside the expected threshold, or 'filter window' , along with possibly erroneous repeated values. The filter window for [CO 2 ] was determined to be 250-1500 ppm, while the filter window for [O 3 ] was 0-500 parts per billion (ppb). After the processing and quality control of the 1-minute fumigation data files, the fumigation measurements were averaged (i.e., the mean and median values were calculated from the 60 1-minute measurements over each hour) via computer code to produce hourly fumigation files 37 (File 11). The hourly fumigation files have proven to be useful, as they are more commonly used in plant growth models than the 4-second and 1-minute fumigation data files. Ambient [CO 2 ] and [O 3 ] are measured at a central location in the SoyFACE field and stored in 1-minute ambient files in the central computer, which have also been consolidated into hourly files.
Precipitation and solar radiation data were recorded at the Water and Atmospheric Resources Monitoring (WARM) station in Champaign (https://warm.isws.illinois.edu/warm/), while other weather metrics such as temperature, wind speed, and relative humidity were recorded at the Surface Radiation (SURFRAD) station about 16 km southwest of the SoyFACE farm (https://gml.noaa.gov/grad/surfrad/bondvill.html). Collectively, the WARM and SURFRAD meteorological data are referred to as CMI weather data due to the stations' proximity to the University of Illinois-Willard Airport (CMI). Computer code that is publicly available on GitHub (https://github.com/eloch216/oscillator-based-circadian-clock-analysis/) can be used to process and consolidate Additional useful metrics include the O 3 exposure indices AOT40, SUM06, and W126 (Eqs. [2][3][4], which are included in the 'SoyFACE Hourly Fumigation Data' files. In these equations, i = 1, …, n represent the daylight hours between 8:00 AM and 7:00 PM. The O 3 exposure indices were measured in parts per billion initially and then converted to parts per million. Unlike the AOT40 and SUM06 indices, which ignore all O 3 concentrations below a certain threshold, the W126 index is a sigmoidal weighted function that gives preferential treatment to higher concentrations of O 3 up to 100 ppb (0.1 ppm) without ignoring the lower concentrations 40  Another important metric that can be computed from the SoyFACE experimental data is the fumigation target percentages (Table 3). This metric calculates the proportion of minutes that the CO 2 or O 3 level is within 10% or 20% of the setpoint when the fumigation system is turned on, which provides insight into the efficiency and accuracy of the SoyFACE experiments. These data can also be found in the 'Fumigation Target Percentages' file; the code that generates the data is described in the supplementary explanatory file 37 (Files 2-3).

Data Records
The data records cited in this work are stored in the Illinois Data Bank, which is a public access repository for publishing research data from the University of Illinois at Urbana-Champaign (https://databank.illinois.edu/). This data set consists of 8 files and 4 zipped folders 37 . Descriptions of these data records are as follows.  Table 3. Fumigation efficiency data for SoyFACE experiments between 2001 and 2021. Percentages are calculated by dividing the total amount of minutes that the fumigation system is turned on and the CO 2 /O 3 measurement is within 10% or 20% of the setpoint by the total amount of minutes that the fumigation system is turned on. Note that overall, the CO 2 fumigation process is more precise than the O 3 fumigation process, maintaining 20% accuracy at least 80% and 2021, including the fumigation treatment type (CO 2 , O 3 , or a combination treatment), crop species, the plot (also referred to as 'ring') numbers used for each experiment, planting, treatment, and harvesting dates, and the gas setpoints 37 (File 12). The full data are also contained in Tables 1-2. SoyFACE 1-minute fumigation data files. The raw fumigation data at SoyFACE are initially recorded as 4-second data files, and subsequently averaged to create 1-minute data files. The 1-minute raw data have been quality controlled to remove erroneous repeated values (Data_Issues_Finder custom code 37 (Files 1 and 9).
The quality controlled 1-minute data files are named ' Avg_MMDDYY' and contained in the 'SoyFACE 1-Minute Fumigation Data Files' folder. The 'SoyFACE 1-Minute Fumigation Data Explanation' file contains the column descriptions, units of measurement, and other important notes 37 (File 8).
SoyFACE hourly fumigation data files. The hourly SoyFACE fumigation files are generated by averaging the CO 2 and O 3 fumigation data from the quality controlled 1-minute data files, ignoring values outside the filter window as described in the Data Collection & Processing Methods sub-section. The hourly fumigation files also include ozone exposure metrics AOT40, SUM06, and W126.
The hourly fumigation files are named 'YYYY_HrlyFumData_ByRing' and contained in the 'SoyFACE Hourly Fumigation Data Files' folder. The 'SoyFACE Hourly Fumigation Data Explanation' file contains the formulas for ozone exposure indices AOT40, SUM06, and W126, details about the custom code used to create the hourly fumigation files, and column descriptions for the files 37 (File 10).
Hourly weather and ambient data files. The 1-minute ambient CO 2 and O 3 data are used to generate hourly ambient data files using the same methods that generate the hourly fumigation data files. The hourly weather data collected from the WARM and SURFRAD stations are combined with the hourly ambient SoyFACE data into single files for each year of the experiments.
The hourly weather and ambient data files are named 'YYYY_HrlyWeatherData' and contained in the 'Hourly Weather and Ambient Data Files' folder. The 'Hourly Weather and Ambient Data Explanation' file contains the column descriptions, units of measurement, and other important notes 37 (File 4).
Fumigation target percentages file. The target fumigation percentages file shows the proportion of minutes during each growing season that the fumigation CO 2 and O 3 levels are within 10% and 20% of the target concentrations (setpoints) for the SoyFACE experiment when the fumigation system is turned on. The 'Fumigation Target Percentages Explanation' file contains details about the custom code used to create the 'Fumigation Target Percentages' file, and column descriptions for the file 37 (File 2). The full data from this file are also contained in Table 3.

Matlab files.
There are several custom Matlab files 37 (File 7) that were created to process and quality control the 'SoyFACE 1-Minute Fumigation Data' files, and to generate the 'SoyFACE Hourly Fumigation Data' and 'Fumigation Target Percentages' files, as enumerated below: 1. rings_for_year: The rings_for_year function takes a specific year as user input and generates a list of the rings (plots) used in that year's SoyFACE experiments as the output variable. 2. Data_Issues_Finder: The rings_for_year function must be run prior to running this code; the user inputs a specific year into that function, and the output is stored as a variable. This output variable is then used as an input for the Data_Issues_Finder code, which loops through the SoyFACE 1-minute raw data files for the year and identifies fumigation measurements that are potentially erroneous by keeping a record of all values that are repeated from one minute to the next. Once the output file has been generated, the user must use qualitative analysis to determine which fumigation measurements are actually erroneous (by comparing the repeated fumigation values to the ambient CO 2 and O 3 concentrations, considering the number of repeats in a row, etc.). Usage details can be found in the 'Data_Issues_Finder Code Explanation' file 37 (File 1). 3. fum: The fum function stores user input details about a specific fumigation experiment as variables, which can then be accessed by other Matlab functions. 4. batch: The batch code allows the user to run the HourlyDataFunction function in bulk, for all dates within the growing period (5/1 through 10/15). 5. HourlyDataFunction: The HourlyDataFunction function takes the output from the fum function as input, along with additional user-provided input. The function uses these inputs to generate output variables such as the file names of the 1-minute fumigation data files, and also calls the HourlyData code so that it does not need to be run separately. 6. HourlyData: The HourlyData code generates the hourly mean and median fumigation metrics from the quality controlled SoyFACE

technical Validation
Technical validation of the fumigation data set was achieved by regular maintenance and calibration of equipment. CO 2 analyzers were calibrated before the start of each growing season and regularly throughout the season using certified gases from an ISO/IEC 17025:2017 accredited source per the manufacturer's instruction. CO 2 calibration is at a single point, 750 ppm, and a verification of 0 ppm. The analyzers self-zero approximately every hour. Ozone analyzers were calibrated before the start of the season with a Thermo Scientific 49 C PS or Thermo Scientific 49i PS ozone transfer standard. The standard was either verified by the United States Environmental Protection Agency as a 'Level 2' standard or verified by Illinois Environmental Protection Agency as a 'Level 3' standard 41 . The US EPA ozone verification program is part of a larger program managed by the National Institute for Standard and Technology.
Ozone analyzers were calibrated during the season, using linear regression at 0 ppb, 50 ppb, 100 ppb, 150 ppb, and 200 ppb. Ozone generators, compressors, and pressure regulators were serviced each field season according to their manuals, and parts were replaced as needed. Data from each of the fumigation plots were compared to test for outliers and the need to calibrate sensors. Wind sensors were maintained each field season and calibrated as needed. The custom software system alerted the FACE site managers of communication and electrical problems.
The fumigation target percentages for each year of the SoyFACE experiments between 2001 and 2021 provide a quantitative measurement of the accuracy of the fumigation process at SoyFACE (Table 3). In particular, Table 3 shows that the CO 2 fumigation process maintained 20% accuracy (i.e., the measured CO 2 value was within 20% of the setpoint) for at least 90% of the time for 17 out of 21 years of CO 2 experiments and maintained 20% accuracy at least 80% of the time for all years. The O 3 fumigation process maintained 20% accuracy at least 90% of the time for 6 out of 20 years of O 3 experiments, at least 80% of the time for 17 out of 20 years, and at least 70% of the time for all years. It is clear that the CO 2 fumigation process is able to achieve greater precision than the O 3 fumigation process, likely because the target elevated concentration for CO 2 was an approximate 50% increase over ambient, whereas the elevated O 3 concentration in recent years was a 150% increase over ambient.

Usage Notes
This SoyFACE fumigation data set can be used as an input for studies that aim to model the impacts of atmospheric change on crop productivity at field, landscape, or regional scales [42][43][44][45] . For example, a recent semi-mechanistic model of soybean biochemistry and growth was developed using data from a few years of the SoyFACE experiment 45 . Having 20 years of SoyFACE data compiled and accessible could enhance the development and testing of such a model. Jin et al. 44 investigated the interaction of rising [CO 2 ] and drought stress on regional soybean production, again using only a few years of data from SoyFACE to parametrize the CO 2 response. Having the full set of fumigation data from SoyFACE could improve scenario simulations, yielding more useful results. The physiological and agronomic data describing crop responses to elevated [CO 2 ] and [O 3 ] at SoyFACE have been previously published in both original manuscripts 25,31,32 and meta-analyses 26,46 and are available as supplemental files in those studies. Here, for the first time, we provide the complete hourly and seasonal fumigation information for 20 years of SoyFACE experiments.
The fumigation data set can also be used to explicitly test how wind speed, wind direction, and other environmental factors impact the precision and efficiency of fumigation. Recent studies have hypothesized that rapid fluctuations in CO 2 concentration in FACE experiments may reduce the photosynthetic, growth, and yield response of crops to elevated CO 2 concentrations 47 . Compiled data from both ambient and elevated [CO 2 ] plots at SoyFACE may provide additional data to test that hypothesis, and to identify parts of the PID algorithm that could be altered to improve fumigation accuracy and precision.

Code availability
The Matlab version used for this work is MATLAB R2022b. The custom Matlab code and functions used to generate several of the supplementary files associated with this work are described in the Matlab Files sub-section of the Data Records section. The Matlab code and functions are described in further detail in the 'Explanation' files, which are publicly accessible via the Illinois Data Bank 37 . The Matlab code and functions are contained in the 'Matlab Files' folder (File 7), and the underlying data set is contained in the 'SoyFACE 1-Minute Fumigation Data Files' folder (File 9).
The R statistical programming language and openair package are required in order to use the windRose function (Figs. 2, 3). The R version used for this work is Rx64 4.1.2, which is free for all users to download. The underlying data set is contained in the 'SoyFACE 1-Minute Fumigation Data Files' folder (File 9).